Search CORE

11 research outputs found

Invariant Models for Causal Transfer Learning

Author: Peters Jonas
Rojas-Carulla Mateo
Schölkopf Bernhard
Turner Richard
Publication venue
Publication date: 01/01/2018
Field of study

Methods of transfer learning try to combine knowledge from several related tasks (or domains) to improve performance on a test task. Inspired by causal methodology, we relax the usual covariate shift assumption and assume that it holds true for a subset of predictor variables: the conditional distribution of the target variable given this subset of predictors is invariant over all tasks. We show how this assumption can be motivated from ideas in the field of causality. We focus on the problem of Domain Generalization, in which no examples from the test task are observed. We prove that in an adversarial setting using this subset for prediction is optimal in Domain Generalization; we further provide examples, in which the tasks are sufficiently diverse and the estimator therefore outperforms pooling the data, even on average. If examples from the test task are available, we also provide a method to transfer knowledge from the training tasks and exploit all available features for prediction. However, we provide no guarantees for this method. We introduce a practical method which allows for automatic inference of the above subset and provide corresponding code. We present results on synthetic data sets and a gene deletion data set

arXiv.org e-Print Archive

Copenhagen University Research Information System

MPG.PuRe

Learning Independent Causal Mechanisms

Author: Kilbertus Niki
Parascandolo Giambattista
Rojas-Carulla Mateo
Schölkopf Bernhard
Publication venue
Publication date: 01/01/2018
Field of study

Statistical learning relies upon data sampled from a distribution, and we usually do not care what actually generated it in the first place. From the point of view of causal modeling, the structure of each distribution is induced by physical mechanisms that give rise to dependences between observables. Mechanisms, however, can be meaningful autonomous modules of generative models that make sense beyond a particular entailed data distribution, lending themselves to transfer between problems. We develop an algorithm to recover a set of independent (inverse) mechanisms from a set of transformed data points. The approach is unsupervised and based on a set of experts that compete for data generated by the mechanisms, driving specialization. We analyze the proposed method in a series of experiments on image data. Each expert learns to map a subset of the transformed data back to a reference distribution. The learned mechanisms generalize to novel domains. We discuss implications for transfer learning and links to recent trends in generative modeling.Comment: ICML 201

arXiv.org e-Print Archive

MPG.PuRe

Recommended from our members

Learning Transferable Representations

Author: Rojas-Carulla Mateo
Publication venue: University of Cambridge
Publication date: 14/02/2019
Field of study

A first contribution of this thesis is to propose causality as a language for problems of distribution shift. First, we consider domain generalisation, where no data from the test distribution are observed during training. What assumptions can be made regarding the relation between train and test distributions for transfer to succeed? We argue that assuming the data in both tasks originate from the same causal graph leads to a natural solution: use only causal features for prediction, as the mechanism mapping causes to effects is invariant to shifts in the probability distributions induced by the causal structure. We provide optimality results when the test task is adversarial, and introduce a method for exploiting all remaining features when data from the test task are observed. We motivate that learning such invariant mechanisms mapping features to outputs leads to machine learning modules robust to transfer. Second, we consider a classification problem where only few examples are available for each label. How should an initial large dataset be leveraged to improve performance in this task? We argue that such a dataset should be used to learn powerful features for batch classification using a neural network. We present a framework which transfers between classes by building a probabilistic model on the weights of the network. Our results suggest that practitioners should use the original dataset for building features whose power can be exploited during few-shot learning. Finally, we extend causal discovery to solve problems such as distinguishing a painting from its counterfeit. Given two such static entities, a proxy random variable introduces the randomness necessary to construct two features of the static entities which preserve their causal footprint, measurable by a standard causal discovery procedure. Experiments on vision and language provide evidence that the causal relation between the static entities can often be identified

Apollo (Cambridge)

MPG.PuRe

Avoiding Discrimination through Causal Reasoning

Author: Hardt Moritz
Janzing Dominik
Kilbertus Niki
Parascandolo Giambattista
Rojas-Carulla Mateo
Schölkopf Bernhard
Publication venue
Publication date: 21/01/2018
Field of study

Recent work on fairness in machine learning has focused on various statistical discrimination criteria and how they trade off. Most of these criteria are observational: They depend only on the joint distribution of predictor, protected attribute, features, and outcome. While convenient to work with, observational criteria have severe inherent limitations that prevent them from resolving matters of fairness conclusively. Going beyond observational criteria, we frame the problem of discrimination based on protected attributes in the language of causal reasoning. This viewpoint shifts attention from "What is the right fairness criterion?" to "What do we want to assume about the causal data generating process?" Through the lens of causality, we make several contributions. First, we crisply articulate why and when observational criteria fail, thus formalizing what was before a matter of opinion. Second, our approach exposes previously ignored subtleties and why they are fundamental to the problem. Finally, we put forward natural causal non-discrimination criteria and develop algorithms that satisfy them.Comment: Advances in Neural Information Processing Systems 30, 2017 http://papers.nips.cc/paper/6668-avoiding-discrimination-through-causal-reasonin

arXiv.org e-Print Archive

MPG.PuRe

Getting personal with epigenetics:towards individual-specific epigenomic imputation with machine learning

Author: Hawkins-Hooker Alex
Narendra Tanmayee
Rojas-Carulla Mateo
Schweikert Gabriele
Schölkopf Bernhard
Visonà Giovanni
Publication venue
Publication date: 07/08/2023
Field of study

Epigenetic modifications are dynamic mechanisms involved in the regulation of gene expression. Unlike the DNA sequence, epigenetic patterns vary not only between individuals, but also between different cell types within an individual. Environmental factors, somatic mutations and ageing contribute to epigenetic changes that may constitute early hallmarks or causal factors of disease. Epigenetic modifications are reversible and thus promising therapeutic targets for precision medicine. However, mapping efforts to determine an individual's cell-type-specific epigenome are constrained by experimental costs and tissue accessibility. To address these challenges, we developed eDICE, an attention-based deep learning model that is trained to impute missing epigenomic tracks by conditioning on observed tracks. Using a recently published set of epigenomes from four individual donors, we show that transfer learning across individuals allows eDICE to successfully predict individual-specific epigenetic variation even in tissues that are unmapped in a given donor. These results highlight the potential of machine learning-based imputation methods to advance personalized epigenomics.</p

University of Dundee Online Publications

Invariant Models for Causal Transfer Learning

Author: Peters Jonas
Rojas-Carulla Mateo
Schölkopf Bernhard
Publication venue: Microtome Publ
Publication date: 01/01/2018
Field of study

Publikationsserver der Universität Tübingen

DeepMAsED: evaluating the quality of metagenomic assemblies

Author: Ley Ruth E.
Mineeva Olga
Rojas-Carulla Mateo
Schölkopf Bernhard
Youngblut Nicholas D.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

Motivation: Methodological advances in metagenome assembly are rapidly increasing in the number of published metagenome assemblies. However, identifying misassemblies is challenging due to a lack of closely related reference genomes that can act as pseudo ground truth. Existing reference-free methods are no longer maintained, can make strong assumptions that may not hold across a diversity of research projects, and have not been validated on large-scale metagenome assemblies. Results: We present DeepMAsED, a deep learning approach for identifying misassembled contigs without the need for reference genomes. Moreover, we provide an in silico pipeline for generating large-scale, realistic metagenome assemblies for comprehensive model training and testing. DeepMAsED accuracy substantially exceeds the state-of-the-art when applied to large and complex metagenome assemblies. Our model estimates a 1% contig misassembly rate in two recent large-scale metagenome assembly publications. Conclusions: DeepMAsED accurately identifies misassemblies in metagenome-assembled contigs from a broad diversity of bacteria and archaea without the need for reference genomes or strong modeling assumptions. Running DeepMAsED is straight-forward, as well as is model re-training with our dataset generation pipeline. Therefore, DeepMAsED is a flexible misassembly classifier that can be applied to a wide range of metagenome assembly projects

Publikationsserver der Universität Tübingen

MPG.PuRe

Getting personal with epigenetics: towards individual-specific epigenomic imputation with machine learning

Author: Hawkins-Hooker Alex
Narendra Tanmayee
Rojas-Carulla Mateo
Schweikert Gabriele
Schölkopf Bernhard
Visona Giovanni
Publication venue: Berlin : Nature Portfolio
Publication date: 01/01/2023
Field of study

Publikationsserver der Universität Tübingen